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(57)Abstract: 

PURPOSE: To provide a character recognition device 
which can correctly recognize the characters even when 
an original is set with its vertical direction reversed or the 
front and the back of a transparent original is reversed. 
CONSTITUTION: A character recognition device 
consists of an image reading part 1 which reads the 
image data, a page image storing part 2 which stores the 
image data in a single page, a character image 
segmenting part 3 which segments the image data on a 
single character from the image data on a single page, a 
character image storing part 4 which stores the image 
data on a single character, an image data vertically 
rotating part 5 which vertically rotates the image data by 
180°, an image data upper/rear side turning part 6 which 

reverses the front and the back of the image data, a recognition dictionary 7 which stores the 
feature information on characters, a character recognition part 8 which obtains the candidate 
characters and the resemblance degree of the image data, and an original vertical and 
front/back direction deciding 9 forjudging in the vertical and front/back direction of the 
original. 
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[Title] Character Recognition Apparatus 
[Abstract] 

[Objective] To enable an apparatus recognizing characters on a document page even when the 

sheet of the document page being placed in the wrong top-to-bottom direction or in a wrong side up 
manner. 

[Constitution] The character recognition apparatus is comprised of: 
an image reader unit 1, 

a page-image memory unit 2 for storing an image-data set covering the image of an entire page, 
a character-image extracting unit 3 for extracting from the image-data set covering the image of the 

entire page, a portion correspondent with an individual character, 

a character-image memory unit 4 for storing the image-data portion correspondent with the individual 

character, 

a top-to-bottom rotating unit 5 for rotating 180 degrees the image of the image-data set or image-data 

portion so that the top and bottom of the image is inverted, 

a right and wrong side reversing unit 6 for reversing sides of the page image of the image-data set, 
a recognition dictionary 7 containing information about features associated with respective characters, 
a character recognizing unit 8 for obtaining character candidates in relation to the image data set 

together with respectively associated similarity degrees, 

a document top-end and right-side judging unit 9 for judging top-and-bottom ends and right-and- wrong 

sides of a document. 
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[Scope of claims] 

Claim 1. A character recognition apparatus comprising: 
an image reader unit for acquiring image data, 

a page-image memory unit for storing an image-data set acquired by the image reader unit and 
covering the image of an entire page, 

a character-image extracting unit for extracting from the image-data set that covers the image of the 
entire page, a portion correspondent with an individual character, 

a character-image memory unit for storing the image-data portion, which is extracted by the 
character-image extracting unit and correspondent with the individual character, 

a top-to-bottom rotating unit for rotating 180 degrees the^ entire-page image of the image-data set 
stored in the page-image memory unit or the individual-character image of the image-data portion stored 
in the character-image memory unit so that the top and bottom of either image is inverted, 

a right and wrong side reversing unit for reversing sides of the entire-page image of the image-data set 
stored in the page-image memory unit or sides of the individual-character image of the image-data portion 
stored in the character-image memory unit, 

a recognition dictionary containing information about features associated with respective characters, 

a character recognizing unit for obtaining character candidates correspondent with the image-data 
portion stored in the character-image memory unit together with similarity degrees associated with the 
character candidates respectively by making reference to the information, which is about the features 
associated with respective characters and contained in the recognition dictionary, and 

a document top-end and right-side judging unit for judging top-and-bottom ends and right-and-wrong 
sides of a document based on the character candidates and associated similarity degrees obtained by the 
character recognizing unit. 

[Detailed explanation of invention] 
[0001] 

[Field of industrial application] The invention is concerned with a character recognition 

apparatus for converting an acquired set of image data into a set of character data, and in particular, with a 
pre-process performed by the character recognition apparatus. 
[0002] 

[Prior art technology] Character recognition apparatuses of this kind and associated with prior 

art, each acquires printed, dot-matrix or handwritten characters contained in a newspaper, magazine, 
report or the like by means of an optical image-reader unit such as a scanner as a set of image data and 
outputs them in a set of coded data, wherein the image data is divided into portions respectively 
representing individual characters and converted into respectively corresponding character codes in 
compliant with a standardized rule such as JIS (Japanese Industrial Standards). 
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[0003] 

[Problem to be solved] A character recognition apparatus according to prior art is unable to 

perform a character recognition process correctly if a subject document page is placed on it wrongly with 
respect to the top-to-bottom end direction. When in a situation in which the subject document page is of a 
transparent sheet such as an OHP sheet, it is necessary to place it correctly with respect to both the 
top-to-bottom end and right-side facing directions for obtaining a successful recognition result. 
[0004] Present invention resolves this problem associated with prior art. The present invention makes 
it the objective to provide a character recognition apparatus that can perform a character recognition 
process correctly either in a case in which a subject document page is placed wrongly in terms of the 
top-to-bottom end direction or in a case in which a transparent sheet constituting a subject document is 
placed in a wrongly faced manner on the character recognition apparatus. 
[0005] 

[Means for solving problem] The character recognition apparatus according to the present invention, in 

order for achieving the above-mentioned objective, comprises, further to those constituting a prior art 

character recognition apparatus: 

a top-to-bottom rotating unit for rotating 180 degrees the image of an image-data set so that the top 

and bottom ends of the image is inverted, 

a right and wrong side reversing unit for reversing sides of the image of an image-data set, and 

a document top-end and right-side judging unit for judging top-and-bottom ends and right-and-wrong 

sides of a document. 

[0006] 

[Benefit] Work load to a user placing subject documents to the character recognition apparatus reduces 
according to the present invention, because it is unnecessary to be specific about the top-to-bottom end 
direction when placing a subject document sheet, and further, if the subject document sheet is constituted 
by a transparent sheet, it is unnecessary to be specific about the right-or-wrong-side facing direction of 
the sheet, either. 
[0007] 

[Embodiment] An embodiment of the present invention is described below with references made to 
drawings. Fig.l is a diagram of function blocks associated with a character recognition apparatus 
representing an embodiment of the present invention. In Fig.l, 1 indicates the image reader unit, which is 
for acquiring an image data set, 2 indicates the page-image memory unit, which is for storing the image 
data-set acquired by the image reader unit 1 and coveririgthe image of an entire page, 3 indicates the 
character-image extracting unit, which is for extracting the image-data portion that is correspondent with 
an individual character from the image-data set that is acquired by the image reader unit 1 and covers the 
image of an entire page, 4 indicates the character-image data memory unit, which is for storing the 
image-data portion that is extracted by the character-image extracting unit 3 and is correspondent with an 
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individual character, 5 indicates the top-to-bottom rotating unit, which is for rotating 180 degrees the 
entire-page image of the image-data set stored in the page-image memory unit 2 or the 
individual-character image of the image-data portion stored in the character-image memory unit 4, 6 
indicates the right and wrong side reversing unit, which reverses sides of the entire-page image of the 
image-data set stored in the page-image memory unit 2 or sides of the individual-character image of the 
image-data portion stored in the character-image memory unit 4, 7 indicates the recognition dictionary, 
which contains information about features associated with respective characters, 8 indicates the character 
recognizing unit, which is for obtaining character candidates correspondent with the image-data portion 
stored in the character-image memory unit 4 together with similarity degrees associated respectively with 
the character candidates by making reference to the information, which is about the features associated 
with respective characters and contained in the recognition dictionary 7, and 9 indicates the document 
top-end and right-side judging unit for judging top-and-bottom ends and right-and-wrong sides of a 
document based on the character candidates and associated similarity degrees obtained by the character 
recognizing unit 8. 

[0008] Fig.2 is a diagram of circuit blocks constituting the character recognition apparatus of a present 
invention embodiment. In Fig.2, 10 indicates a random-access-memory (hereafter called RAM), 11 
indicates a read-only-memory (hereafter called ROM), 12 indicates a central processing unit (hereafter 
called CPU) and 13 indicates a scanner. ~ 

[0009] Circuit blocks are employed to implement respective units shown in Fig.l as follows. The 
image reader unit 1 is implemented by a scanner 13, the page-image memory unit 2 and character-image 
memory unit 4 are implemented by a RAM 10, the character-image extracting unit 3, the top-to-bottom 
rotating unit 5, the right and wrong sides reversing unit 6 and the document top-end and right-side 
judging unit 9 are implemented by operation of a CPU 12 that executes a computer-program stored in the 
ROM 11 and exchanges data with the RAM 10, the recognition dictionary 7 is implemented by the ROM 
11, and the character recognizing unit 8 is implemented by the CPU 12 that executes a computer-program 
stored in the ROM 11 and exchanges data with the ROM 11 and RAM 10. 

[0010] The operation of the character recognition apparatus of a present invention embodiment is as 
described below in a manner associated with the flowchart shown in Fig.3. 

[0011] First, at step A, a user obtains an image-data set using an image reader unit 1. The image-data 
set thus obtained and covering the image of an entire page is stored in a page-image memory unit 2. 
[0012] At step B, a character-image extracting unit 3 extracts an image-data portion correspondent 
with an individual character from the image-data set stofed in the page-image memory unit 2 and 
covering the image of an entire page. The data correspondent with the individual character is stored in a 
character-image memory unit 4. 

[0013] At step C, a top-to-bottom rotating unit 5 generates a rotated-character image-data portion from 
the original image-data portion stored in the character-image memory unit 4. A right and wrong sides 



4 



Our ref: 2000FJ508 
JP-A-h07-28945 



reversing unit 6 generates a side-reversed-character image-data portion from the original image-data 
portion stored in the character-image memory unit 4 arid separately a top-end right-side inverted 
image-data portion from the rotated-character image-data portion. 

[0014] At step D, a character-recognizing unit 8 obtains character candidates and associated similarity 

degrees for the original image-data portion, rotated-character image-data portion, side-reversed-character 

image-data portion and top-end right-side inverted image-data portion, respectively. 

[0015] At step E, the document top-end and right-side judging unit 9 judges whether the character of 

which the associated similarity degree obtained in step D is the highest is an unacceptable character. If it 

is judged to be an unacceptable character, the operation goes back to step B. Otherwise, it proceeds to 

stepF. 

[0016] At step F, the document top-end and right-side judging unit 9 evaluates the image-data portion 
representing the character of which the associated similarity degree obtained in step D is the highest. If 
the image-data portion is the original image-data portion, the operation moves to step K. If it is the 
rotated-character image-data portion, the operation moves to step G. If it is the side-reversed-character 
image-data portion, the operation moves to step I. And if it is the top-end right-side inverted image-data 
portion, the operation moves to steps H and J. 

[0017] At steps G and H, the top-to-bottom rotating unit 5 rotates 180 degrees the entire page image 
defined by the image-data set, which is stored in the page-image memory unit 2 so that the top and 
bottom ends of the associate page image are interchanged. 

[0018] At steps I and J, the right and wrong sides reversing unit 6 reverses the sides of the entire page 
image that is defined by the image-data set stored in the page-image memory unit 2. 
[0019] At step K, the character recognizing unit 8 performs character recognizing processes with 
respect to the image-data set stored in the page-image memory unit 2 and covering an entire page. 
[0020] Here, the manner in which characters are recognized in a present invention embodiment is 
provided more specifically using drawings provided in Figs. 4, 16, 7, 8, 9 and 11. The drawing in Fig.4 is 
an illustration to show how a document image is rotated 180 degrees to interchange its top and bottom 
ends. The drawing in Fig.5 is an illustration to show how a transparent document image is reversed 
between its right and wrong sides. The drawing in Fig.6 illustrates that a transparent document image is 
first rotated 180 degrees to interchange its top and bottom ends and then is reversed between its right and 
wrong sides. The drawing in Fig.7 is an example of a character image defined by an image-data portion 
resulting from a character image extraction process performed over an image-data set, which is obtained 
by a scanner 13 and covers an entire page. The drawing InFig.8 is the top-to-bottom rotated character 
image resulting from 180-degree rotation of the original image shown in Fig.7. The drawing in Fig.9 is 
the side reversed character image resulting from right and wrong side reversal of the original image 
shown in Fig.7. The drawing in Fig. 10 is the top-and-bottom ends right-and-wrong sides inverted 
character image resulting from right and wrong side reversal of the top-to-bottom rotated character image 
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shown in Fig.8. Lastly, Fig.ll shows the result of a character recognition process performed over 
image-data sets respectively representing images shown in Figs.7, 8, 9 and 10. 
[0021] The character recognition performed basing on data contained in the table shown in Fig. 11 
identifies character as the one associated with the highest similarity degree among character 
candidates. In a case in which the character candidate identified as the one associated with the highest 
similarity degree is any of the following characters, the operation returns to a step in which a character 
image is extracted from the image-data set covering an entire page, as any of these characters are 
unacceptable for use in judging the placement direction of a document. 

Characters virtually symmetric about the vertical line at the center. Examples are "A" and "M". 

Characters virtually symmetric about the horizontal line at the center. Examples are "C " and "E w . 

Characters virtually symmetric around the center point. Examples are "Z" and "N". 

Characters of which the rotated variations resemble to other characters. An example is "n" of which the 
rotated variation resembles "u". 

Characters of which the mirror images constitute other characters. Examples are character pairs of 
"b" and "d", and "p" and "q". 

[0022] According to the table shown in Fig.ll, the character recognition, here, has yielded the result 
that "the top-end right-side inverted character image" has the highest similarity degree. Based on this 
result the apparatus judges that the document has been placecTon the scanner 13 wrongly both in the 
top-and-bottom direction and in the side selection. As it is impossible to produce a correct character 
recognition result by performing character recognition over the image-data set of which the correspondent 
page-image is in this state, the image-data set is modified into one to represent the page-image which is 
180-degree rotated and then turned over with respect to the document sides before commencing the 
character recognition. 
[0023] 

[Effect of invention] 

According to the present invention, as it becomes clear from the foregoing, the character recognition 
apparatus is further provided with a top-to-bottom rotating unit for rotating 180 degrees an image 
associated with an image-data set so that the top and bottom ends of the image is converted, a right and 
wrong sides reversing unit for reversing sides of an image associated with an image-data set and a 
document top-end and right-side judging unit for judging top-and-bottom ends and right-and wrong sides 
of a document. Consequently it becomes unnecessary for a user to be specific about the top-to-bottom 
direction in which a subject document sheet is placed, and further about the right-or-wrong-side facing 
direction when if the subject document sheet is constituted by a transparent sheet. 

Work load to a user placing subject documents to the character recognition apparatus reduces in this way. 
[Brief explanation to drawings] 
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[Fig.l] Functional-block diagram of a present invention embodiment, a character recognition apparatus 
[Fig.2] Circuit-block diagram of a present invention embodiment, a character recognition apparatus 
[Fig.3] Flowchart of a present invention embodiment, a character recognition apparatus 
[Fig.4] Schematic illustration of a document being rotated to have its top and bottom ends 
interchanged according to the manner associated with a present invention embodiment 
[Fig.5] Schematic illustration of a document being turned over to have its right and wrong sides 
interchanged according to the manner associated with a present invention embodiment 
[Fig.6] Schematic illustration of a document being rotated to have its top and bottom ends 
interchanged and further turned over to have its right and wrong sides interchanged according to the 
manner associated with a present invention embodiment 

[Fig.7] Image of an original character associated with a present invention embodiment 

[Fig.8] Character-image of the original character of which the top and bottom ends are interchanged in 

association with a present invention embodiment 

[Fig.9] Character-image of the original character of which the right and wrong sides are interchanged 
in association with a present invention embodiment 

[Fig. 10] Character-image of the original character of which the top and bottom ends are interchanged 
and further its right and wrong sides are interchanged in association with a present invention embodiment 
[Fig. 11] Tabular description of the relation between character candidates and associated similarity 
degrees according to the manner associated with a present invention embodiment 
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